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METHOD OF BUFFER MANAGEMENT AND TASK SCHEDULING FOR 
TWO-DIMENSIONAL DATA TRANSFORMING 

5 CROSS-REFERENCE TO RELATED APPLICATION 

This application claims the priority benefit of provisional application serial no. 
60/217,825, filed July 12, 2000, the full disclosure of which is incorporated herein by 
reference. 

/ 0 BACKGROUND OF THE INVENTION 

Field of Invention 

The present invention relates to a method of buffer management and task 
scheduling. More particularly, the present invention relates to a method of buffer 
management and task scheduling for two-dimensional data transforming. 
1 5 Description of Related Art 

Two-dimensional Discrete Fourier Transforms, such as Discrete Cosine 
Transform (DCT), are widely used in image processing, digital signal processing, digital 
communications, etc. Due to its intrinsic properties, two-dimensional transform 
operation requires significant amount of memory to store data. One good example is a 
20 two-dimensional DCT (2D-DCT) operated on a two-dimensional image. As illustrated 
in FIG. 1, an image 10 is scanned line-by-line from the left-top corner to the right- 
bottom corner. In order to perform 2D-DCT with 6x6 block size 12, at least five lines 
of data 16 have to be stored. Furthermore, new input data (not shown) keeps coming in 
while old data 14 are being processed. Since the input data follow a line-by-line pattern 
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and the old data 14 are processed in a block-by-block pattern, there is a conflict in 
memory space. 

Traditionally, there are two solutions to solve this problem. The first solution is 
to use two six-line buffers, one for reading out data and the other for writing data in, and 
5 then swapping the buffers every six lines. However, this requires double the memory 
space than required in theory. The second solution is to process the 2D-DCT very 
quickly and to clean out the memory space before the read-write conflict occurs. 
Though, this causes significant processing bursts while receiving the "last line" 18 of 
the 6x6 blocks 12. It does not only require very extensive computations for such 

/ 0 processing bursts, but also waste the computing resources while staying idle and waiting 
for all necessary data to be processed. 

The invention provides a new solution to the problem stated above. The 
invention uses the minimum space required in theory and distributes the process loading 
evenly in time domain. It eliminates the trade-off between memory size and computing 

75 power, which has been recognized as a serious dilemma in 2D Discrete Fourier 
Transforms. The invention also presents a practical implementation of these presented 
ideas which is very simple and, therefore, consumes much less power than traditional 
designs do. 



20 SUMMARY OF THE INVENTION 

The invention provides a method of buffer management and task scheduling for 
two-dimensional data transforming, which transforms data in a buffer sequentially by 
blocks, wherein a block comprises a specific size having plurality of row portions and a 
plurality of column portions. The method comprises the steps of initializing a write 
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logic address and a read logic address to zero. Then, old data is read out in a block-by- 
block pattern. While reading out data, new data is written in immediately in a line-by- 
line pattern in the block using a first mapping scheme. After the whole block is read 
out, the block is moved to process another column portion. Then old data in the block is 
5 again read out in a block-by-block pattern while new data is again written in 
immediately in a line-by-line pattern in the block using a second mapping scheme. 
After the whole block is read out again, the block is moved to process another column 
portion. These read and write steps with the interleaving of the first and second 
mapping schemes are repeated until the whole data in the buffer is processed. 

10 Prior to the read and write steps, the block is completely filled with data using a 

second mapping scheme. After the block is filled, the write logic address is 
incremented by 1. Accordingly, the write logic address is incremented by 1 per buffer 
write, and the read logic address is incremented by 1 per buffer read as well. 

According to the invention, the first and second mapping scheme translates 

/ 5 logical addresses to physical addresses, which are really used to access the buffer. 

It is to be understood that both the foregoing general description and the 
following detailed description are exemplary, and are intended to provide further 
explanation of the invention as claimed. 

20 BRIEF DESCRIPTION OF THE DRAWINGS 

The accompanying drawings are included to provide a further understanding of 
the invention, and are incorporated in and constitute a part of this specification. The 
drawings illustrate embodiments of the invention and, together with the description, 
serve to explain the principles of the invention. In the drawings, 
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FIG. 1 is a conventional method of buffer management and task scheduling for 
two-dimensional data transforming. 

FIG. 2 is a logical addressing method of buffer management and task scheduling 
for two-dimensional data transforming in a preferred embodiment according to the 
5 invention. 

FIG. 3 is a flow chart overview of the operational procedure of buffer 
management and task scheduling for two-dimensional data transforming in a preferred 
embodiment according to the invention. 

7 0 DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Reference will now be made in detail to the present preferred embodiments of 
the invention, examples of which are illustrated in the accompanying drawings. 
Wherever possible, the same reference numbers are used in the drawings and the 
description to refer to the same or like parts. 

75 Before discussing the embodiments of the invention, a general and qualitative 

description of the invention should prove useful. The basic idea behind the invention is 
to keep the data input and output rates the same. In other words, in average, one datum 
from processing is taken out from the memory while writing one new datum into the 
same memory. By doing so, the memory space can be fully utilized and the process 

20 loading can be evenly distributed. This may sound straightforward, however in reality, 
what just have been described is very hard to implement, since the input and output data 
follow different patterns. The 2D data is scanned in a line-by-line pattern, but they are 
processed in a block-by-block pattern. If new data is simply filled in the memory 
locations that are just emptied, the memory read-out pattern needs to be changed to 
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fulfill the original block-by-block output requirement. This means that the memory is 
being emptied in a different manner. Therefore, the pattern for writing the following 
data in the memory changes. In consequence, the memory read-out pattern for the next 
data changes again. The situation repeats itself. After a few iterations, the patterns for 
5 writing and reading the memory look almost random. Unless a huge record to track all 
the previous input and output data is available, it is very hard, if not impossible, to know 
where to write the new data and read out the necessary data for processing. Keeping 
such a record is not practical because it will require much more memory space than 
needed in storing the original data. In this invention, by re-arranging the order of output 

1 0 blocks, robust addressing schemes are created for input and output sequences to resolve 
the problem discussed above. 

The operations of the novel buffer management and task scheduling for 2D-DCT 
is best described by an example. FIG. 2 is a logical addressing scheme of a buffer 
management and task scheduling for two-dimensional data transforming in a preferred 

/ 5 embodiment according to the invention. 

In FIG. 2, assuming a 2D-DCT with 8x8 block size 22 is performed on an image 
20 of 512 pixels per line. A 9-bit binary word 23 is used to address the pixels on each 
line, and a 3 -bit binary word 25 is used to address the minimal 8 lines required for the 
8x8 block size. 

20 The 9-bit pixel address 23 is separated into 3-bit "dot", 3-bit "block", and 3-bit 

"sector" addresses. Associating this with the 3-bit line address 25, {3-bit line address, 
9-bit pixel address} is equal to {3-bit line address, 3-bit sector address, 3-bit block 
address, 3-bit dot address}. Two logical addresses of this kind is obtained for buffer 
write and buffer read respectively. These two logical addresses are increased by one 
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from zero per access. In addition, two types of address mappings are defined for buffer 
write and buffer read. These two mappings translate logical addresses to physical 
addresses, which are really used to access the memory of the buffer. So, a write logical 
address, a write physical address, a read logical address, a read physical address, and 
5 two address mapping schemes is obtained. The two address mapping schemes are listed 
as follows: 

Mapping Scheme A; Physical Address = {Sector Address, Line Address, Block 
Address, Dot Address} 

Mapping Scheme B: Physical Address = {Sector Address, Block Address, Line 
1 0 Address, Dot Address} 

FIG. 3 is a flow chart overview of the operational procedure of buffer 
management and task scheduling for two-dimensional data transforming in a preferred 
embodiment according to the invention. In step 30, for the first time usage, the memory 
buffer is filled up using Mapping Scheme A for buffer writes. The write logic address 
/ 5 is increased by one from zero per buffer write. 

Next, at step 31, when the buffer is full, it is ready to perform an 8x8-block size 
2D-DCT operation. The read logic address is increased by one from zero per read 
write. Data of 8x8 block are read out by using Mapping Scheme B for buffer read. This 
mapping automatically guarantees the output sequence required by 2D-DCT operations. 
20 Then, at step 32, while reading out data for 2D-DCT, the memory buffer is still 

receiving input data. The "one out, one in" method is followed to maximize the buffer 
efficiency. The input data by using Mapping Scheme B for buffer writes is then 
recorded. 
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Further, at step 33, after the whole buffer is read out for processing and is 
replaced by new data, it is ready to process another 8 lines stored in the buffer. It is 
essential to make sure the output sequence is correct for 2D-DCT operations. Data is 
read out by using Mapping Scheme A for buffer reads this time. This mapping 
5 automatically guarantees the output sequence required by 2D-DCT operations. 

Moreover, at step 34, as in step 32, the memory buffer is still receiving input 
data while reading out data for the 2D-DCT. The input data is recorded by using 
Mapping Scheme A for buffer writes. 

Finally, after the whole buffer is read out for processing and is replaced by new 
10 data, Mapping Scheme B is switched for both buffer reads and writes. As from step 31 
to step 34, the two mapping schemes are interleaved to guarantee correct output 
sequences while the buffer is kept full all the time. 

By interleaving the simple address mapping schemes, the dilemma in 2D data 
transforms can be resolved. In addition, the output flow is continuous after the first 
75 filling. The task scheduling automatically follows the data output throughout. The 
process loading is smoothed out without loading bursts. 

In the invention, a pre-buffer can be used to store data. This pre-buffer can be 
replaced by a "staging buffer" which stores temporary data between the "first one- 
dimensional" and "second one-dimensional" operations in 2D-DCT processes. The 
20 same concept can be applied to the staging buffer. In addition, the mapping schemes 
mentioned above can be modified or combined with more address manipulations to 
accommodate different operational requirements. 

The invention achieves maximum buffer efficiency without trading off loading 
requirement. It does need a complicated address recording scheme to fulfill the output 
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sequences required by 2D-DCT operations. The 2D-DCT operations can also be 
scheduled accordingly. No existing methods can resolve the same dilemma without 
increasing the system complexity significantly. 

It will be apparent to those skilled in the art that various modifications and 
5 variations can be made to the structure of the present invention without departing from 
the scope or spirit of the invention. In view of the foregoing, it is intended that the 
present invention cover modifications and variations of this invention provided they fall 
within the scope of the following claims and their equivalents. 
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